Data Warehousing for Association Mining

نویسنده

  • Yuefeng Li
چکیده

IntroductIon With the phenomenal growth of electronic data and information, there are many demands for developments of efficient and effective systems (tools) to address the issue of performing data mining tasks on data warehouses or multidimensional databases. Association rules describe associations between itemsets (i.e., sets of data items) (or granules). Association mining (or called association rule mining) finds interesting or useful association rules in databases, which is the crucial technique for the development of data mining. Association mining can be used in many application areas, for example, the discovery of associations between customers' locations and shopping behaviours in market basket analysis. Association mining includes two phases. The first phase is called pattern mining that is the discovery of frequent patterns. The second phase is called rule generation that is the discovery of the interesting and useful association rules in the discovered patterns. The first phase, however, often takes a long time to find all frequent patterns that also include much noise as well (Pei and Han, 2002). The second phase is also a time consuming activity (Han and Kamber, 2000) and can generate many redundant rules (Zaki, 2004) (Xu and Li, 2007). To reduce search spaces, user constraint-based techniques attempt to find knowledge that meet some sorts of constraints. There are two interesting concepts that have been used in user constraint-based techniques: metarules (Han and Kamber, 2000) and granule mining (Li et al., 2006). The aim of this chapter is to present the latest research results about data warehousing techniques that can be used for improving the performance of association mining. The chapter will introduce two important approaches based on user constraint-based techniques. The first approach requests users to inputs their metarules that describe their desires for certain data dimensions. It then creates data cubes based these metarules and then provides interesting association rules. The second approach firstly requests users to provide condition and decision attributes that used to describe the antecedent and consequence of rules, respectively. It then finds all possible data granules based condition attributes and decision attributes. It also creates a multi-tier structure to store the associations between granules, and association mappings to provide interesting rules.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology-Incorporated Mining of Association Rules in Data Warehouse

With the rise of the Internet and the development of various electronic information resources, mining useful information from large databases has become one of the most important issues in information research for users. Data warehousing plays the key role in providing data for data mining tools to explore knowledge. However, there are still many problems that cause users to spend extra time to...

متن کامل

Granule mining oriented data warehousing model for representations of multidimensional association rules

To promise the quality of multidimensional association mining in real applications is a challenging research issue. The challenging issue is how to represent multidimensional association rules efficiently because of the complicated correlation between attributes. Multi-tier granule mining is one initiative in solving this challenge. This paper presents a granule mining oriented data warehousing...

متن کامل

Concepts, challenges, and prospects on Multiagent Data Warehousing (MADWH) and Multiagent Data Mining (MADM)

Multiagent Data Warehousing (MADWH) and Multiagent Data Mining (MADM) presents a multidimensional agent-oriented approach for brain modelling and decision making based on the hypothesis that a brain system consists of a society of semiautonomous neural agents and full autonomy is the result of coordination of semiautonomous functionalities. The agent-oriented approach leads to the following con...

متن کامل

Sequential Pattern Mining

Sequential pattern mining deals with data represented as sequences (a sequence contains sorted sets of items). Compared to the association rule problem, a study of such data provides “inter-transaction” analysis (Agrawal & Srikant, 1995). Applications for sequential pattern extraction are numerous and the problem definition has been slightly modified in different ways. Associated to elegant sol...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009